Enabling platforms for high - performance computational grids oriented scalable virtual organizations Workpackage 6 Knowledge Services for Intensive Data Analysis and Intelligent Query Answering WP

نویسنده

  • Franco Turini
چکیده

The activities included in this work package share the common objective of building middleware services for knowledge intensive applications and processes. Specifically, the services we intend to design and implement concern two main topics: • information extraction and knowledge discovery from structured sources (databases or data warehouses) and semi-structured ones (such as web pages or XML documents); • use of extracted information and knowledge within high performance search and query answering high performance services, both for efficiency tuning and for quality of service improvement.. The activities aim at exploiting the lower layers of the Grid architecture in order to implement the functionalities of knowledge discovery and/or to design distributed algorithms (such as data mining algorithms or search engine sub-modules) or to yield modules that can be integrated in the development environment for applications of network and Grid computing. Services that we intend to design are logically structured in three levels over the layer of basic services offered by the toolkits of the grid: Architectural services are developed within the Activity 5, Web Switching. Basic Services are developed within the Activity 1, Basic Services for Knowledge Discovery on Grids, and the activity 3, Resource Discovery and Description. Retrieval services are developed within the activity 4, Retrieval Services. Knowledge Services are developed within the activity 2, Environments and Services for Knowledge Discovery. The services of the KNOWLEDGE GRID are organized into two layers: (1) Core Knowledge-Grid layer and (2) High-level Knowledge-Grid layer. The first layer includes the data management services, by implementing them directly over the basic services offered by the toolkits of the grid. An important data management service is integrating and querying data warehouses distributed on a grid. In particular, methods and tools have been defined to enable interoperability of grid semi-structured information sources using " active " iperlinks in XML documents for service calls to support interaction among sites. In addition, the usage of extensional meta-data has been proposed to provide a synopsis (i.e., an aggregated and approximated description) of the contents of a number of grid information sources, particularly sensors producing a continuous flow of data (data stream). During the first year, we extended and deployed the High level Knowledge Grid architecture, designed a metadata-based information system and a data mining ontology we be used in the system. Knowledge services Retrieval services Basic services Architectural Services 3 The Knowledge Grid is a parallel and distributed software architecture that …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowledge and Data Management in Grids: Notes on the State of the Art

Knowledge and data management is a key topic in Grid computing. Today and in the near future, data, information, and knowledge are critical elements in the application of Grids in several sectors of our society. As Grids become pervasive in human activities, the management of data and the derivation and manipulation of knowledge play an increasingly significant role in enabling high-level appli...

متن کامل

Grid - based Distributed Data Mining Systems , Algorithms and Services ∗

Distribution of data and computation allows for solving larger problems and execute applications that are distributed in nature. The Grid is a distributed computing infrastructure that enables coordinated resource sharing within dynamic organizations consisting of individuals, institutions, and resources. The Grid extends the distributed and parallel computing paradigms allowing resource negoti...

متن کامل

A comparative analysis of dynamic grids vs. virtual grids using the A3pviGrid framework

With the proliferation of Quad/Multi-core micro-processors in mainstream platforms such as desktops and workstations; a large number of unused CPU cycles can be utilized for running virtual machines (VMs) as dynamic nodes in distributed environments. Grid services and its service oriented business broker now termed cloud computing could deploy image based virtualization platforms enabling agent...

متن کامل

Distributed Anomaly Detection and Prevention for Virtual Platforms

An increasing number of applications are being hosted on cloud based platforms [69]. Cloud platforms are serving as a general computing facility and applications being hosted on these platforms range from simple multitier web applications to complex social networking, eCommerce and Big Data applications. High availability, performance and auto-scaling are key requirements of Cloud based applica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004